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Abstract. We consider a model of a random copolymer at a selective interface which 
undergoes a localization/delocalization transition. In spite of the several rigorous results 
available for this model, the theoretical characterization of the phase transition has re- 
mained elusive and there is still no agreement about several important issues, for example 
the behavior of the polymer near the phase transition line. From a rigorous viewpoint 
non coinciding upper and lower bounds on the critical line are known. 

In this paper we combine numerical computations with rigorous arguments to get to 
a better understanding of the phase diagram. Our main results include: 

- Various numerical observations that suggest that the critical line lies strictly in between 
the two bounds. 

- A rigorous statistical test based on concentration inequalities and super-additivity, for 
determining whether a given point of the phase diagram is in the localized phase. This is 
applied in particular to show that, with a very low level of error, the lower bound does 
not coincide with the critical line. 

- An analysis of the precise asymptotic behavior of the partition function in the de- 
localized phase, with particular attention to the effect of rare atypical stretches in the 
disorder sequence and on whether or not in the delocalized regime the polymer path has 
a Brownian scaling. 

- A new proof of the lower bound on the critical line. This proof relies on a characteri- 
zation of the localized regime which is more appealing for interpreting the numerical data. 

Keywords: Disordered Models, Copolymers, Localization Transition, Large Deviations, 
Corrections to Laplace estimates, Concentration of Measure, Transfer Matrix Approach, 
Statistical Tests 
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1. Introduction 

1.1. The model. Let S = {S n } n =o,i t ... be a random walk with So = and S n = Y^j=i Xj, 
{Xj}j a sequence of IID random variables and P (X\ = 1) = P (X\ = —1) = 1/2. For 
A > 0, h > 0, N G 2N and w = {wj}j=i,2,... G we introduce the probability measure 
P 7vt denned by 

dP X ' h 1 / N \ 

—^(S) = exp A^(u; n + / i )sign(5 n ) , (1.1) 

Z N,uj \ n=l J 

where Z^f 1 ^ is the partition function and sign(S2n) is set to be equal to sign(5 , 2 n -i) for 
any n such that S^n = 0. This is a natural choice, as it is explained in the caption of Fig.^ 

For what concerns the charges uj we put ourselves in a quenched set-up: a; is a typical 
realization of an IID sequence of random variables (we denote by P its law). We suppose 
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Figure 1. The process we have introduced is a model for a non-homogeneous polymer, 
or copolymer, near an interface, the horizontal axis, between two selective solvents, say 
oil (white) and water (grey) . In the drawing the monomer junctions are the small black 
rounds and the monomers are the bonds of the random walk. The big round in the middle 
of each monomer gives the sign of the charge (white = positive charge = hydrophobic 
monomer, black = negative charge = hydrophilic monomer). When h > water is the 
unfavorable solvent and the question is whether the polymer is delocalized in oil or if 
it is still more profitable to place a large number of monomers in the preferred solvent, 
leading in such a way to the localization at the interface phenomenon. The conventional 
choice of sign(O) we have made reflects the fact that the charge is assigned to bonds 
rather than points. 



that 

M(a) := E [exp (aui)] < oo , 
for every a and that E [u>i] = 0. Moreover we fix E[u;i 2 ] = 1. 



X2) 



1.2. The free energy and the phase diagram. We introduce the free energy of the 
system 

/(A '^ = A) hn oo^ 1 ° g ^i- (L3) 

The limit has to be understood in the P ( du;)-almost sure sense, or in the Lj (P) sense, and 
/(A, h) does not depend on u. A proof of the existence of such a limit goes along a standard 
superadditive argument and we refer to ^1] for the details, see however § 12.11 below. By 
convexity arguments one easily sees that the free energy is a continuous function. 
We observe that 

/(A, h) > Xh. (1.4) 
In fact if we set 0,^ = {S : S n > for n = 1, 2, . . . , iV} 

/ n N 
exp A (u n + h) 



— log Z^ h > — log E 



n=l 



N 



N 1 

K + h) + - log P {n+ ) ^5° Ah, (1.5) 



n=l 



where the limit has to be understood in the P( dw)-almost sure sense: notice that we have 
used the law of large numbers. We have of course also applied the well known fact that 
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P behaves like N' 1 / 2 for N large QH Ch. III]. In view of jOJ and of (JT3J) we 

partition the phase diagram in the following way: 

• The localized region: C = {(A, h) : /(A, h) — Xh > 0}; 

• The delocalized region: T> = {(A, h) : /(A, /i) — A/i = 0}. 

This phase diagram decomposition does correspond to different behaviors of the trajec- 
tories of the copolymer: we will come back to this important issue in § 11.41 

We sum up in the following theorem what is known about the phase diagram of the 
model. 

Theorem 1.1. There exists a continuous increasing function h c : [0, oo) — ► [0,oo) ; 
h c (0) = 0, such that 

C = {(X,h) : h < h c (X)} and V = {(A, h) : h > h c {\)} . (1.6) 

Moreover 

- (A) := 4x73 log M ( ~ 4A/3) - hc{x) - k log M ( ~ 2A) =: ^ (A) - (L7) 

This implies that the slope at the origin belongs to [2/3,1], in the sense that the inferior 
limit of h c (X) / 'A as A \ is not smaller than 2/3 and the superior limit is not larger than 
1. 



Remark 1.2. In ^S] it is proven that the limit of h c {\)/\ as A \ does exist and it is 
independent of the distribution of cji, at least when uj% is a bounded symmetric random 
variable or when u% is a standard Gaussian variable 15_ . This universal character of the 
slope at the origin makes this quantity very interesting. 

Theorem 11.11 is a mild generalization of the results proven in |5j and [3] : the extension 
lies in the fact that u\ is not necessarily symmetric and a proof of it requires minimal 
changes with respect to the arguments in PJ. The lower bound in (|1.7j) is actually proven 
explicitly in Appendix El (see also Section 0J), but we stress that we present this proof 
because it is a new one and because it gives some insight on the computational results. 
For what follows we set 

h {m) (X) = ^L. log M (-2mA), (1.8) 

for m > 0. Observe that the curves h(-) and h(-) defined in ()1.7|) correspond respectively 
to m = 2/3 and m = 1, and that -^/i (m) (A)| A =o = m. 



Remark 1.3. Notice that one can write 

-0 s ) = ^br ex P ( " 2A Yl ( w ™ + h ) A n), (1.9) 



a~p x ' h 1 / v 

"dp -1 ] ~ ~z> 

A N,u; \ n=l 



with A n = (1 — sign(S' n )) /2 and Z^^ := a new partition function which coincides 
with Zjsr t w exp ^— A Y^n=i( u n + h)\ and therefore we have 

F(X,h) := lim L\ og Z N ^ = f(X,h)-Xh. (1.10) 

N—*oo JV 

This limit of course has to be interpreted in the P(du;)-a.s. sense. We stress that, even 
if equivalent to Zm,uj exp(-XhN) in the Laplace asymptotic sense, Z^^ turns out to be 
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substantially more useful. This had been already realized in jHj, but for our results looking 
at Zn iUJ , rather than Z^^, is even more essential. Moreover from now on f(A, h), rather 
than /(A, h), will be for us the free energy. 



We will use repeatedly also the partition function associated to the model pinned at 
the right endpoint: 

exp -2A ^2(uj n + h) A n \ ; S N = x 



n=l 



It is worth recalling that one can substitute Z^ h u with Z^'^(x), any fixed even x (recall 
that N S 2N), in (|1.1U() and the limit is unchanged, see e.g. jS] or [Tlj . 

1.3. A random walk excursions viewpoint. We present here a different viewpoint 
on the process: this turns out to be useful for the intuition and it will be used in some 
technical steps. 

We call J] the first return time of the walk S to 0, that is rj := inf {n > 1 : S n = 0}, and 
set K{2n) := P (jj = 2n) for n € N. It is well known that K (■) is decreasing on the even 
natural numbers and 

lim x 3 / 2 K(x) = y/2fr, (1.12) 

see e.g. |121 Ch. 3]. Let the IID sequence {??j} J=1 2 denote the inter-arrival times at for 
S, and we set Tfc :=% + •••+ Vk- If we introduce also £jy = max{j E N U {0} : Tj < N}, 
then by exploiting the up-down symmetry of the excursions of S we directly obtain 



Zn,w{0) = E Y[<p A Un + Xhrij \-Te N =N 

j=l \ n=Tj-i+l J 
N X / Xi 



(1.13) 



\\ f A ^ u n + Xh(xi - x^x) \K(xi-Xi-i) 



1=0 xo,...,x ; G2N i=l \ n=Xi_i+l 
0=:x <...<xi:=N 



with (p(t) := (1 + exp(— 2t)) /2. Of course the formula for Zjv,w is just slightly different. 

Formula (|1.13|) reflects the fact that what really matters for the copolymer are the 
return times to the interface. 

1.4. Known and conjectured path properties. The question of whether splitting the 
phase diagram into the regions C and T> does correspond to really different path behaviors 
has a positive answer, at least if we do not consider the critical case, that is if we consider 
the path behavior for (A, h) £ C and for (A, h) in the interior of T>. However, while the 
localized regime is rather well understood, the delocalized one remains somewhat elusive 
(we take up this point again in Section f4.1j) . More precisely: 

• For (A, h) G C one knows that the polymer is going to stay very close to the 
interface, essentially at distance 0(1) and the polymer becomes positive recurrent 
for N — » oo. Due to the disordered distribution of the charges, even the most 
elementary results in this direction require a careful formulation and we prefer to 
refer to Q], [21 and jUJ. 

• For (A, h) in the interior of T> one can prove by large deviation arguments that there 
are o(N) visits to the unfavorable solvent and by more sophisticate arguments 
that these visits are actually O(logiV) 15 j. These results are in sharp contrast 
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with what happens in C and in this sense they are satisfactory. However they give 
at the same time still a weak information on the paths, above all if compared to 
what is available for non disordered models, see e.g. (El and references 

therein, namely Brownian scaling, which in turn is a consequence of the fact that 
all the visits in the unfavorable solvent happen very close to the boundary points, 
that is the origin, under the measure Pj} u - In non disordered models one can in 
fact prove that the polymer becomes transient and that it visits the unfavorable 
solvent, or any point below a fixed level, only a finite number of times. Recently it 
has been shown ^3] that such a result cannot hold as stated, at least for h < h(X), 
for the disordered copolymer. However the results in |15j leave open the possibility 
of Brownian scaling in the whole delocalized region. 



1.5. Outline of the results. Formula (|1.7[) leaves an important gap, that hides the 
only partial understanding of the nature of this delocalization/localization transition. Our 
purpose is to go toward filling this gap: our results are both of theoretical and numerical 
nature. At the same time we address the derealization issues raised in § 11.41 which are 
intimately related with the precise asymptotic behavior of Zn,u and of Zn iU (0). More 
precisely: 

(1) In Section [21 we present a statistical test with explicit error bounds, see Proposi- 
tion based on super-additivity and concentration inequalities, to state that a 
point (A, h) is localized. We apply this test to show that, with a very low level of 
error, the lower bound h = h(X) defined in (|1.7f) does not coincide with the critical 
line. 

(2) In Section|31we give the outline of a new proof of the main result in 3 . The details 
of the proof are in Appendix B and we point out in particular Proposition IB. 21 
that gives a necessary and sufficient condition for localization. This viewpoint on 
the transition, derived from |15| Section 4], helps substantially in interpreting the 
irregularities in the behavior of {Zj^ )U1 } N as N f oo. 

(3) In Section |1] we pick up the conjecture of Brownian scaling in the delocalized 
regime both in the intent of testing it and in trying to asses with reasonable 
confidence that (A, h) is in the interior of T>. In particular, we present quantitative 
evidences in favor of the fact that the upper bound h = /i(A) defined in (|1.7|) is 
strictly greater than the critical line. We stress that this is a very delicate issue, 
since derealization, unlike localization, does not appear to be reducible to a finite 
volume issue. 

(4) Finally, in Section we report the results of a numerical attempt to determine the 
critical curve. While this issue has to be treated with care, mostly for the reasons 
raised in point 4 above, we observe a surprising phenomenon: the critical curve 
appears to be very close to h^ m \-) for a suitable value of m. By the universality 
result proven in [T3J , building on the free energy Brownian scaling result proven in 
0, the slope at the origin of h c {-) does not depend on the law of u. Therefore if 
really h^ m \-) = h c {-), since the slope at the origin of h( m H-) is m, m is the universal 
constant we are looking for. We do not believe that the numerical evidence allows 
to make a clear cut statement, but what we observe is compatible with such a 
possibility. 
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We point out that our numerical results are based on a numerical computation of the 
partition function Znw, exploiting the standard transfer-matrix approach (this item is 
discussed in more details in Appendix ^J. 

1.6. A quick overview of the literature. The copolymer in the proximity of an inter- 
face problem has a long literature, but possibly the first article that attracted the attention 
of mathematicians is |16j . Here we are going to focus on very specific issues and the most 
interesting for our purposes is that in the physical literature both the conjecture that 
h{-) = h c (-) (cf. [201 and US!) and that K') = (cf. [23) are set forth - The approaches 
are non rigorous, mostly based on replica computations, with the exception of |2U| whose 
method is the real space renormalization technique for one-dimensional disordered sys- 
tems first proposed in in the context of quantum Ising model with transverse field and 
then applied with remarkably precise results to random walk in random environment, see 
e. g. .17,. The result in j^j, that h(-) < h c (-), is obtained by exploiting the path behavior 
of the copolymer near criticality suggested in [201 • This strategy may by summed up by: 
the localized polymer close to criticality is mostly delocalized in the upper half-plane and 
it keeps in the lower half-plane only the rare portions with an atypically negative charge. 
The numerical results that we set forth in this work are saying that this strategy is not 
good enough. 

At the opposite end, the result h(-) > h c (-), albeit relatively subtle, is absolutely ele- 
mentary to prove [5J. And such a bound does not depend at all on the details of the walk: 
any non trivial null recurrent walk with increments in { — 1, 0, +1} leads to the same upper 
bound. This suggests that such a bound is too rough. One can however prove that the 
standard procedure for obtaining upper bounds that goes under the name of constrained 
annealing cannot improve such a bound [7j. This is in any case far from being a proof that 
h(-) = h c (-), and in fact the numerics suggest that this is not the case. 

In the literature one finds also a large number of numerical works on copolymers, we 
mention here for example [0], [2H] and references therein. As far as we have seen, the 
attention is often shifted toward different aspects, notably of course the issue of criti- 
cal exponents, and the more complex model in which the polymer is not directed but 
rather self-avoiding, see [0] and [23] also for some rigorous results and references in such 
a direction. 

Our work has been led rather by the idea that understanding the precise location of 
the critical curve is a measure of our understanding of the nature of the transition. Un- 
derstanding that, in turn, could promote an advance on the mathematical analysis of the 
copolymer and, more generally, of this kind of disordered models. 

2. A STATISTICAL TEST FOR THE LOCALIZED PHASE 

2.1. Checking localization at finite volume. At an intuitive level one is led to believe 
that, when the copolymer is localized, it should be possible to detect it by looking at 
the system before the infinite volume limit. This intuition is due to the fact that in the 
localized phase the length of each excursion is finite, therefore for N much larger that 
the typical excursion length one should already observe the localization phenomenon in a 
quantitative way. The system being disordered of course does not help, because it is more 
delicate to make sense of what typicality means in a non translation invariant set-up. 
However the translation invariance can be recovered by averaging and in fact it turns out 
to be rather easy to give a precise meaning to the intuitive idea we have just mentioned. 
The key word here is super-additivity of the averaged free energy. 
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In fact by considering only the S trajectories such that S2N = and by applying the 
Markov property of S one directly verifies that for any N, M £ N 

Z2N+2M,u(S>) > Z 2N,uj(0) Z 2Myd 2N UJ (0), (2.1) 
(0u>) n = oj n+ \, and therefore 

{ElogZ^O)}^^ (2.2) 

is a super-additive sequence, which immediately entails the existence of the limit of 
E[log Z2jv jW (0)]/2iV and the fact that this limit coincides with the supremum of the se- 
quence. Therefore from the existence of the quenched free energy we have that 

f(A,/i) = sup— ElogZ 2A r )a ,(0). (2.3) 

N 2iV 

In a more suggestive way one may say that: 

(A, h)eC <=^ there exists iVeN such that Elog Z 2 n,U°) > • ( 2 - 4 ) 

The price one pays for working with a disordered system is precisely in taking the P- 
expectation and from the numerical viewpoint it is an heavy price: even with the most 
positive attitude one cannot expect to have access to ElogZ2jv jtlJ (0) by direct numerical 
computation for N above 10. Of course in principle small values of N may suffice (and 
they do in some cases, see Remark l2.1j) . but they do not suffice to tackle the specific issue 
we are interested in. We elaborate at length on this interesting issue in § 12.41 

Remark 2.1. An elementary application of the localization criterion (J2.4|) is obtained for 
N= 1: (X,h) £ C if 



E 



log ( - + - exp (-2A (wi + oj 2 + 2h)) 



> 0. (2.5) 



In the case P(wi = ±1) = 1/2 from (|2.5|) we obtain that for A sufficiently large h c (X) > 
1 — c/A, with c = (1/4) log(2exp(4) — 1) ~ 1.17. From h(-) we obtain the same type of 
bound, with c = (3/4) log 2 ~ 0.52. This may raise some hope that for A large an explicit, 
possibly computer assisted, computation for small values of of E log Z2N,u(fy could lead 
to new estimates. This is not the case, as we show in § 12.41 

2.2. Testing by using concentration. In order to decide whether E log Z 2 iV,<j (0) > 

we resort to a Montecarlo evaluation of E log Z 2 at,u>(0) that can be cast into a statistical 
test with explicit error bound by means of concentration of measure ideas. This procedure 
is absolutely general, but we have to choose a set-up for the computations and we take 
the simplest: P(wi = +1) = P(cji = —1) = 1/2. The reason for this choice is twofold: 

• if oj\ is a bounded random variable, a Gaussian concentration inequality holds 
and if u is symmetric and it takes only two values then one can improve on the 
explicit constant in such an inequality. This speeds up in a non negligible way the 
computations; 

• generating true randomness is out of reach, but playing head and tail is certainly 
the most elementary case in such a far reaching task (the random numbers issue 
is briefly discussed in Appendix lAl too) . 

A third reason to restrict testing to the Bernoulli case is explained at the end of the 
caption of Table [21 
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We start the testing procedure by stating the null hypothesis: 

HO: ElogZ 2N JO)<0. 



(2.6) 



iV in HO can be chosen arbitrarily. We stress that refusing HO implies Elog Z2N,u(fy > 0, 
which by ()2.4j) implies localization. 

The following concentration inequality for Lipschitz functions holds for the uniform 
measure on {— 1, +1} : for every function Gn '■ {—1,+1} N — ► M such that \Gn(u) — 
Gn(w')\ < CLipV^S^LiC^n - w[,) 2 ), where Cup a positive constant and Gn(uj) is an 
abuse of notation for Gn(uji, . . . one has 

E [exp (a (G N (u) - E[G N (u)]))] < exp (a 2 C7^ ip ) , (2.7) 

for every a. Inequality (|2.7|) with an extra factor 4 at the exponent can be extracted from 
the proof of Theorem 5.9, page 100 in ^Sj. Such an inequality holds for variables taking 
values in [—1, 1]: the factor 4 can be removed for the particular case we are considering 
(see ^2 p. 110-111]). In our case G/v(u;) = log ^2iV,u>(0)- By applying the Cauchy-Schwarz 
inequality one obtains that Gn is Lipschitz with Cu P = 2XyN. Let us now consider an 

G$(w) = G N (u): if I 
n G N, u > and a = un/8X 2 N 



IID sequence {G$(ui)}i with G$ (uj) = Gat(cj): if HO holds then we have that for every 



1 



£)G$(o;)>u) < E 



exp 



< exp 



exp 



n 

2 \2 ; 



(G N (u>) - E[G N {u)]) exp (-a (u - E[G N (u)})) 



Aa z \ z N 



au 



n 



u 2 n \ 
\Q\ 2 N ) 



(2- 



Let us sum up what we have obtained: 



Proposition 2.2. Let us callu n the average of a sample of n independent realizations of 
log Z^N^iO). Ifu n >0 then we may refuse HO, and therefore (X,h) G C, with a level of 
error not larger than exp (— u 2 n/16A 2 iV) . 



2.3. Numerical tests. We report in Table ^ the most straightforward application of 
Proposition 12.21 obtained by a numerical computation of log Zn for a sample of n inde- 
pendent environments uj. We aim at seeing how far above h{-) one can go and still claim 
localization, keeping a reasonably small probability of error. 

Remark 2.3. One might be tempted to interpolate between the values in Table ^ or 
possibly to get results for small values of A in order to extend the result of the test to the 
slope of the critical curve in the origin. However the fact that h c (X) is strictly increasing 
does not help much in this direction and the same is true for the finer result, proven in 
4 , that h c (X) can be written as U(X)/\, U(-) a convex function. 
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A 


0.3 


0.6 


1 


h 


0.22 


0.41 


0.58 


p-value 


1.5 x 10~ o 


9.5 x 10~ a 


1.6 x 10~ 5 


h(X) 


0.195 


0.363 


0.530 


h(X) 


0.286 


0.495 


0.662 


N 


300000 


500000 


160000 


n 


225000 


330000 


970000 


C. I. 99% 


7.179 ± 0.050 


9.011 ±0.045 


7.643 ± 0.025 



Table 1. According to our numerical computations, the three pairs (A, h) are in C and 
this has been tested with the stated p-values (or probability /level of error). We report 
the values of h(X) and h{\) for reference. Of course in these tests there is quite a bit of 
freedom in the choice of n and N: notice that N enters in the evaluation of the p- value 
also because a larger value of N yields a larger value of ElogZjjv^O). I n the ^ as ^ une 
we report standard Gaussian 99% confidence intervals for ElogZ 2 ^ w (0). Of course the 
p-value under the Gaussian assumption turns out to be totally negligible. 



2.4. Improving on h{-) is uniformly hard. One can get much smaller p-values at little 
computational cost by choosing h just above h(X). As a matter of fact a natural choice 
is for example h = h^°' 67 \X) > h(X), recall (|1.8|) . for a set of values of A, and this is 

^ ^(0.67) 

part of the content of Table [2j in particular E log Z 2 ' N+ w (0) > with a probability of 
error smaller than 10 -5 for the values of A between 0.1 and 1. However we stress that for 
some of these A's we have a much smaller p-value, see the caption of Table EJ an d that 
the content of this table is much richer and it approaches also the question of whether or 
not a symbolic computation or some other form of computer assisted argument could lead 
to h c (X) > h(X) for some A, and therefore for A in an interval. Since such an argument 
would require N to be small, intuitively the hope resides in large values of A, recall also 
Remark 12. II It turns out that one needs in any case N larger than 700 in order to observe 
a localization phenomenon at /i( 0,67 )(A). We now give some details on the procedure that 
leads to Table H 



A 


0.05O) 


0.1 


0.2 


0.4 


0.6 


1 


2(*) 


4(**) 


8(**) 


N + 


750000 


190000 


40000 


9500 


4250 


1800 


900 


800 


800 


iV_ 


600000 


130000 


33000 


7500 


3650 


1550 


750 


700 


700 



Table 2. For a given A, both E log [ u (A) (0) > and E log [ u <A) (0) < with 
a probability of error smaller than 10 -5 (and in some cases much smaller than that). 
Instead for the two cases marked by a (*) the level of error is rather between 10 -2 and 
10 -3 . For large values of A, the two cases marked with (**), it becomes computationally 
expensive to reach small p-values. However, above A = 3 one observes that the values of 
Z2n,lj(0) essentially do not depend anymore on the value of A. This can be interpreted in 
terms of convergence to a limit (A — > oo) model, as it is explained in Remark 12.41 If we 
then make the hypothesis that this limit model sharply describes the copolymer along 
the curve (A, /i' m '(A)) for A sufficiently large and we apply the concentration inequality, 
then the given values of N+ and A''- are tested with a very small probability of error. 
Since the details of such a procedure are quite lengthy we do not report them here. We 
have constructed (partial) tables also for different laws of w, notably u>i ~ N(0, 1), and 
they turned out to yield larger, at times substantially larger, values of N±(X). 
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First and foremost, the concentration argument that leads to Proposition 12.21 is sym- 
metric and it works for deviations below the mean as well as above. So we can, in the very 
same way, test the null hypothesis E log Z^n,u (0) > and, possibly, refuse it if u n < 0, 
exactly with the same p-value as in Proposition ^. 21 Of course an important part of Propo- 
sition [221 was coming from the finite volume localization condition (|2.4jl : we do not have 
an analogous statement for delocalization (and we do not expect that there exists one). 
But, even if Elog Z2N,u(fy < does not imply delocalization, it says at least that it is 
pointless to try to prove localization by looking at a system of that size. 

In Tabled we show two values of the system size N, N + and iV_, for which, at a given A, 
one has that Elog Z2n+,lu(0) > and E log ^2Af_,w(0) < with a fixed probability of error 
(specified in the caption of the Table). It is then reasonable to guess that the transition 
from negative to positive values of Elog Z. ^(0) happens for N G (N-,N + ). There is no 
reason whatsoever to expect that Elog Zn,w(0) should be monotonic in N but according 
to our numerical result it is not unreasonable to expect that monotonicity should set in for 
N large or, at least, that for N < iV_ (respectively N > N + ) E log Z2N,u(fi) is definitely 
negative (respectively positive). 




0.05 



0.10 



0.20 



0.50 

A 



1.00 



2.00 



5.00 



Figure 2. A graphical representation of Table|5| The plot is log-log, and a A _c behavior 
is rather evident, c is about 2.08. This can be nicely interpreted in terms of the coarse 
graining technique in the proof of the weak interaction scaling limit of the free energy 
in [Sj: from that argument one extracts that if A is small the excursions that give a 
contribution to the free energy have typical length A -2 and that in the limit the polymer 
is just made up by this type of excursions. One therefore expects that it suffices a 
system of size N(\), with limA\,o A 2 A r (A) = +oo, to observe localization if m < h' c (0), 
h = h {m) (X) = mA(l + o(l)) and A is small. 



Remark 2.4. As pointed out in the caption of Table [21 from numerics one observes a 
very sharp convergence to a A independent behavior as A becomes large, along the line 
h = /i( m )(A). This is easily interpreted if one observes that M m )(A) = 1 - ((log 2) /2mA) + 
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0(exp(— 4mA)) so that 



Dm exp ^" 2A E ^ n + ^ = ex P f^T ^ An ) A„(i+^)=o}( 5 )- ( 2 - 9 ) 

This corresponds to the model where a positive charge never enters the lower half-plane 
and where the energy of a configuration is proportional to the number of negative charges 
in the lower half-plane. 



3. Lower bound strategies versus the true strategy 

3.1. An approach to lower bounds on the critical curve. In this section we give an 
outline of a new derivation of the lower bound 

h(X) < /i c (A), (3.1) 

with h(X) defined in (|1.7[) . The complete proof may be found in Appendix El The argu- 
ment takes inspiration from the ideas used in the proof of Proposition 3.1 in ^S] and, even 
if it is essentially the proof of [3] in disguise, in the sense that the selection of the random 
walk trajectories that are kept and whose energy contribution is evaluated does not differ 
too much (in a word: the strategy of the polymer is similar), it is however conceptually 
somewhat different and it will naturally lead to some considerations on the precise asymp- 
totic behavior of Zn,u in the delocalized phase and even in the localized phase close to 
criticality. 

The first step in our proof of (|3.1|) is a different way of looking at localization. For 
any fixed positive number C we introduce the stopping time (with respect to the natural 
filtration of the sequence {to n }) T c = T c,X ' h (u>) defined by 

T G,x > h (u) := mf{N G 2N : z££(0) > C} . (3.2) 

The key observation is that if E[T ] < oo for some C > 1, then the polymer is localized. 
Let us sketch a proof of this fact (for the details, see Proposition IB . 2|) : notice that by the 
very definition of T c we have Z T c^ jU (0) > C. Now the polymer that is in zero at T c (u>) is 

equivalent to the original polymer, with a translated environment and setting 

T 2 {uj) := T c (lo') we easily get Z Tl[Lo)+T , i[Lo) ^) > C 2 (we have put T^uj) := T c (lo)). 
Notice that the new environment uJ is still typical, since T c is a stopping time, so that T 2 is 
independent of T\ and has the same law. This procedure can be clearly iterated, yielding an 
IID sequence }t=i,2,... that gives the following lower bound on the partition function: 

^T 1 H+...+T n H,a,(0) > C n . (3.3) 
From this bound one easily obtains that 

K J ™ T 1 {uj) + ... + T n {uj) -E[T C ]' V ' 

where we have applied the strong law of large numbers, and localization follows since by 
hypothesis C > 1 and E[T C ] < oo. 

Remark 3.1. It turns out that also the reciprocal of the claim just proved holds true, 
that is the polymer is localized if and only if K[T C ] < oo, with an arbitrary choice of 
C > 1, see Proposition IB.2I In fact the case EfT"- 7 ] = oo may arise in two different ways: 
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(1) the variable T c is defective, ¥[T C = oo] > 0: in this case with positive probability 
{Zn,uj(0)}n is a bounded sequence, and derealization follows immediately; 

(2) the variable T is proper with infinite mean, F[T C = oo] = 0, E[T°] = oo: in this 
case we can still build a sequence {Ti(k>)}i=i,2,... defined as above and this time 
the lower bound (|3.3j) has subexponential growth. Moreover it can be shown that 
in this case the lower bound (|3.3|) gives the true free energy, cf. Lemma IB. 11 which 
therefore is zero, so that derealization follows also in this case. 

As a matter of fact, it is highly probable that in the interior of the delocalized phase 
Z]sf jU! (0) vanishes P(da;)-a.s. when N — ► oo and this would rule out the scenario © above, 
saying that for C > 1 the random variable T c must be either integrable or defective. We 
take up again this point in Sections |I] and [5J we feel that this issue is quite crucial in order 
to fully understand the delocalized phase of disordered models. 

Remark 3.2. Dealing directly with T c may be difficult. Notice however that if one finds 
a random time (by this we mean simply an integer-valued random variable) T = T{oj) 
such that 

Zt(u)A°) > C > 1 . with E P1 < 00 . ( 3 - 5 ) 
then localization follows. This is simply because this implies T c < T and hence E[T ] < 
oo. Therefore localization is equivalent to the condition log Zx(u>),u>{^) > f° r an integrable 
random time T(u): we would like to stress the analogy between this and the criterion for 
localization given in § 12.11 see (12.4(1 . 

Now we can turn to the core of our proof: we are going to show that for every (A, h) with 
h < h(X) we can build a random time T = T(u) that satisfies (|3.5j) . The construction of 
T is based on the idea that for h > if localization prevails is because of rare w-stretches 
that invite the polymer to spend time in the lower half-plane in spite of the action of h. 

The strategy we use consists in looking for (/-atypical stretches of length at least M G 
2N, where q < — h is the average charge of the stretch. Rephrased a bit more precisely, we 
are looking for the smallest n G 2N such that Yli=n-k+l u i/^ < 1 f° r some even integer 
k > M. It is well known that such a random variable grows, in the sense of Laplace, as 
exp(S(g)M) for M — > oo, where S(g) is the Cramer functional 

:=sup{a<j-logM(a)}. (3.6) 

One can also show without much effort that the length of such a stretch cannot be much 
longer than M. Otherwise stated, this is the familiar statement that the longest g-atypical 
sub-stretch of ui, . . . , lon is of typical length ~ log A r /S((/). So T[u) is for us the end-point 
of a (/-atypical stretch of length approximately (logT(w))/S(g): by looking for sufficiently 
long (/-atypical stretches we have always the freedom to choose T(u) ^> 1, in such a way 
that also logT(cj) <C T(oS) and this is helpful for the estimates. So let us bound Z T r w \ u 
from below by considering only the trajectories of the walk that stay in the upper half- 
plane up to the beginning of the (/-atypical stretch and that are negative in the stretch, 
coming back to zero at step T{u) (see Fig- El the polymer is cut at the first dashed vertical 
line). The contribution of these trajectories is easily evaluated: it is approximately 

For such an estimate we have used (|1.12|) and logT(w) -C T(u) both in writing the 
probability that the first return to zero of the walk is at the beginning of the (/-atypical 
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stretch and in neglecting the probability that the walk is negative inside the stretch. It is 
straightforward to see that if 

4A 4A 

-±h<~q-'£(q), (3.8) 

and if T(u) is large, then also the quantity in (|3.7|) is large. We can still optimize this 
procedure by choosing q (which must be sufficently negative, i.e. q < —h). By playing 
with (|3,6|) one sees that one can choose go £ ^ such that for q = q the right-hand side 
in (JSIHJ) equals log M(-4A/3) and if h < log M(-4A/3)/(4A/3) = h{\) then g < -h. This 
argument therefore is saying that there exists C > 1 such that 

Zr{ u )A°) ^ C > ( 3 - 9 ) 
for every uj. It only remains to show that E[T] < oo: this fact, together with a detailed 
proof of the argument just presented can be found in Appendix iBl 




Figure 3. Inequality 13.1011 comes simply from restricting the evaluation of Z t ^)+l,lo 
to the trajectories visiting the q-atypical stretch of length I and by staying away from 
the unfavorable solvent after that. 



3.2. Persistence of the effect of rare stretches. As pointed out in the previous sec- 
tion, there is strong evidence that h c (X) > h(X). At this stage Fig. |1] is of particular 
interest. Notice first of all that in spite of being substantially above h{-) the copolymer 
appears to be still localized, see in particular case A. 

The rigorous lower bounds that we are able to prove cannot establish localization in 
the region we are considering. All the same, notice that if one does not cut the polymer 
at T(lo), as in the argument above, but at T(lv) + L, a lower bound of the following type 

roughly 1 / logT(u;)\ 1 

Zt(u)+l*, > const. /2 exp(^-2A(g + fr) ^ j (3.10) 

is easily established. Of course we are being imprecise, but we just want to convey the idea, 
see also Fig. |3 that after passing through an atypically negative stretch of environment 
(q > 0), the effect of this stretch decays at most like L -1 / 2 , that is the probability that a 
walk stays positive for a time L. 

At this point we stress that the argument outlined in S 13. H and re-used for (|3.1UI) may be 
very well applied to h > h(X), except that this time it does not suffice for (|3,9|) . But it yields 
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nevertheless that for h £ (h(A),h(X)} the statement Z^^ ~ iV -1 / 2 , something a priori 
expected (for example jH]) in the delocalized regime and true for non disordered systems, 
is violated. More precisely, one can find a sequence of random times linx,- tj = oo 

such that Z T . W > Tj~ 1 / 2+a , a = a(\,h) > (see Proposition 4.1 in These random 

times are constructed exactly by looking for g-atypical stretches as above and one can 
appreciate such an irregular decay for example in case B of Fig. |IJ and this in spite of the 
fact that the data have been strongly coarse grained. 

Therefore the lower bound ()3.10j) . both in the localized and in the delocalized regime, 
yields the following picture: the lower bound we found on Zpf ttJ grows suddenly in cor- 
respondence of atypical stretches and after that it decays with an exponent 1/2, up to 
another atypical stretch. This matches Fig. |1J at least on a qualitative level, see the cap- 
tion of the figure. 

Of course it very natural to ask what is missing, on a theoretical level, to the strategy 
that we are adopting for the lower bound to match the quantitative discrepancy. Moreover, 
since the u sequence is of course known, one may look at the atypical stretches, this 
time defined by the points of sudden growth of Zn,u), and look for the specificity of such 
stretches. Up to now we have not been able to extract from this analysis definite answers. 

4. The delocalized phase: a path analysis 

Let us start with a qualitative observation: if we set the parameters (A, h) of the copoly- 
mer to (A,/i( m )(A)) with m = 0.9, then the observed behavior of {^^'^(0)}^ -suitably aver- 
aged over blocks in order to eliminate local fluctuations- is somewhat close to (const) /iV 3 / 2 . 
This is true for all the numerically accessible values of iV (up to N ~ 10 8 ), at once for a 
number of values of A and for a great number of typical environments uj. Of course this is 
suggesting that for m = 0.9 the curve h^ m \\) lies in the delocalized region, but it is not 
easy to convert this qualitative observation into a precise statement, because we do not 
have a rigorous finite-volume criterion to state that a point (A, h) belongs to the delocal- 
ized phase (the contrast with the localized phase, see (|2.4|) . is evident). In other words, we 
cannot exclude the possibility that the system is still localized but with a characteristic 
size much larger than the one we are observing. 

Nevertheless, the aim of this section is to give an empirical criterion, based on an 
analysis of the path behavior of the copolymer, that will allow us to provide some more 
quantitative argument in favor of the fact that the curve h^ m \\) lies in the delocalized 
region even for values of m < 1. This of course would entail that the upper bound /i(A) 
defined in (|1,7|) is not strict. 

4.1. Known and expected path behavior. We want to look at the whole profile 
{Z^^r (x)}xeZ rather than only at Z^r'^ r (0), where by oo r we mean the environment u 
in the backward direction, that is {uf) n := u>N + i^ n (the reason for this choice is explained 
in Remark 14. II below) . The link with the path behavior of the copolymer, namely the law 
of Sn under the polymer measure Pj^r , is given by 

7 X,h = F N,u J r(S N = x). (4.1) 

As already remarked in the introduction, although the localized and delocalized phases 
have been defined in terms of free energy they do correspond to sharply different path 
behaviors. In the localized phase it is known |241 [2] that the laws of Sn under uT are 
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tight, which means that the polymer is essentially at 0(1) distance from the x-axis. The 
situation is completely different in the (interior of the) delocalized phase, where one expects 
that Sjy = 0(y/~N): in fact the conjectured path behavior (motivated by the analogy with 
the known results for non disordered models, see in particular [2^, and [Hj) should 
be weak convergence under diffusive scaling to the Brownian meander process (that is 
Brownian motion conditioned to stay positive on the interval [0, 1], see |23j ). Therefore in 
the (interior of the) delocalized phase the law of Sn/^/N under Pj} uT should converge 
weakly to the corresponding marginal of the Brownian meander, whose law has density 
xexp(-x 2 /2)l( x > )- 

We stress however that for the delocalized regime the rigorous results that are available 
are more meager: essentially the only known P( dw)-a.s. result is that for any L > 

1 ^ 



1™ E A>- 

N^oo ' 



jy ^ ~(S n >L) 
n=l 



1 P(dw)-a.s., (4.2) 



that is the polymer spends almost all the time above any prefixed level. More precise results 
have been derived for the path behavior of the polymer under the quenched averaged 
measure EE J [•] : these results go in the direction of proving the conjectured scaling 
limit, but they still do not suffice (we refer to [T5. for more details and also for a discussion 
on what is still missing). 

In spite of the lack of precise rigorous results, the analysis we are going to describe is 
carried out under the hypothesis that, in the interior of the delocalized phase, the scaling 
limit towards Brownian meander holds true (as it will be seen, the numerical results provide 
a sort of a posteriori confirmation of this hypothesis). 

Remark 4.1. From a certain point of view attaching the environment backwards does 
not change too much the model: for example it is easy to check that if one replaces u> by 
uf in (jl.3|) . the limit still exists P(dw)-a.s. and in Li(dP). Therefore the free energy is 
the same, because {a£}i< n <jv has the same law as {uJ n }i< n <N, for any fixed N. 

However, if one focuses on the law of Sn as a function of N for a hxed environment 
u>, the behavior reveals to be much smoother under P^ r than under PjyL* For instance, 
under the original polymer measure P^y^, it is no more true that in the localized region 
the laws of Sn are tight (it is true only most of the time, see ^1] for details). The reason 
for this fact is to be sought in the presence of long atypical stretches in every typical u 
(this fact has been somewhat quantified in |15| Section 4] and it is at the heart of the 
approach in Section I2J that are encountered along the copolymer as N becomes larger. Of 
course the effect of these stretches is very much damped with the backward environment. 

A similar and opposite phenomenon takes place also in the delocalized phase. In fancier 
words, we could say that for fixed uj and as N increases, the way Sjv approaches its limiting 
behavior is faster when the environment is attached backwards: it is for this reason that 
we have chosen to work with P^ m*- • 

4.2. Observed path behavior: a numerical analysis. In view of the above considera- 
tions, we choose as a measure of the delocalization of the polymer the l\ distance AiJ (uj) 
between the numerically computed profile for a polymer of size 2N under P^^, and the 
conjectured asymptotic delocalized profile: 



7 X ' h \/2N 



<p+(x):=xe-* / 2 1 {X > 0) . (4.3) 
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Loosely speaking, when the parameters (A, h) are in the interior of the the delocalized 
region we expect A^r to decrease to as N increases, while this certainly will not happen 
if we are in the localized phase. 

The analysis has been carried out at A = 0.6: we recall that the lower and upper bound 
of (|1.7|) give respectively h(0.Q) ~ 0.36 and h(0.&) ~ 0.49, while the lower bound we 
derived with our test for localization is h = 0.41, see Tabled However, as observed in 
Section |21 Fig. 21 there is numerical evidence that h = 0.43 is still localized, and for this 
reason we have analyzed the values of h = 0.44, 0.45, 0.46, 0.47 (see below for an analysis 
on smaller values of h). 

For each couple (A, h) we have computed (cj) for the sizes N = a x 10 6 with 
a = 1,2,5,10 and for 500 independent environments. Of course some type of statistical 
analysis must be performed on the data in order to decide whether there is a decay of A 
or not. The most direct strategy would be to look at the sample mean of a family of IID 
variables distributed like A at (to), but it turns out that the fluctuations are too big to 
get reasonable confidence intervals for this quantity (in other words, the sample variance 
does not decrease fast enough), at least for the numerically accessible sample sizes. A 
more careful analysis shows that the variance is essentially due to a very small fraction of 
data that have large deviations from the mean, while the most of the data mass is quite 
concentrated. 

Remark 4.2. It is actually interesting to observe that the rare samples that affect the 
sample variances are in reality very close to meanders anyway, only with a smaller variance. 
This is the signature of the presence of atypical pinning stretches in the w-sequence close 
to the boundary. A fine analysis of this aspect would lead us too far and it is left for future 
investigation. 

We have therefore chosen to focus on the sample median rather than on the sample 
mean. Table 01 contains the results of the analysis (see also Fig. for a graphical repre- 
sentation): for each value of h we have reported the standard 95% confidence interval for 
the sample median (see Remark 14.31 below for details) for the four different values of N 
analyzed. While for h = 0.44 the situation is not clear, we see that for the values of h 
greater than 0.45 there are quantitative evidences for a decrease in A at: this leads us to 
the conjecture that the points (A, h) with A = 0.6 and h > 0.45 (equivalently, the points 
(A,/i( m )(A)) with m > 0.876) lie in the delocalized region. 



h\N(xW b ) 


1 


2 


5 


10 


0.44 


[.0603, .0729] 


[.0574, .0682] 


[.0572, .0689] 


[.0570, .0695] 


0.45 


[.0258, .0286] 


[.0207, .0232] 


[.0170, .0190] 


[.0149, .0171] 


0.46 


[.0140, .0154] 


[.0108, .0116] 


.00792, .00869] 


.00647, .00731] 


0.47 


[.00905, .00963] 


[.00676, .00711] 


[.00475, .00508] 


[.00364, .00398] 



Table 3. The table contains the standard 95% confidence interval for the median of a 
sample {A^ h (ui)}^ of size 500, where A = 0.6 and h, N take the different values reported 
in the table. For the values of h > 0.45 the decreasing behavior of Ajv is quite evident 
(the confidence intervals do not overlap), see also Fig. [S] 



As already remarked, these numerical observations cannot rule out the possibility that 
the system is indeed localized, but the system size is too small to see it. For instance, we 
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have seen that there are evidences for h = 0.43 to be localized (see case C of Fig. QJ. In 
any case, the exponential increasing of Zn(0) is detectable only at sizes of order ~ 10 8 , 
while for smaller system sizes (up to~ 10 7 ) the qualitative observed behavior of Zjy(0) 
is rather closer to (const) / TV 3 / 2 , thus apparently suggesting derealization (see case D of 
Fig.©. 

For this reason it is interesting to look at A^ 6 ' h for h = 0.42, 0.43 and for N -C 10 8 . For 
definiteness we have chosen N = ax 10 6 with a = 1, 2, 5, 10, performing the computations 
for 3000 independent environments: the results are reported in Table|U(see also Fig.EJ)- As 
one can see, this time there are clear evidences for an increasing behavior of A at. On the 
one hand this fact gives some more confidence on the data of Table 01 on the other hand it 
suggests that looking at {Atv}tv is a more reliable criterion for detecting (de) localization 
than looking at {Zn(Q)}n. 



h\N(xlO b ) 


1 


2 


5 


10 


0.42 




.351, 0.382] 




;.480, 0.517] 




.751, 0.794] 




1.01, 1.06] 


0.43 




.143, 0.155] 




.165, 0.180; 




.197, 0.215] 




.236, 0.264] 



Table 4. The table contains the standard 95% confidence interval for the median of a 
sample {A^ h (uj)}^ of size 3000, where A = 0.6 and h, N take the values reported in the 
table. For both values of h an increasing behavior of Ajv clearly emerges, see also Fig.|S] 
for a graphical representation. 



Remark 4.3. A confidence interval for the sample median can be obtained in the following 
general way (the steps below are performed under the assumption that the median is 
unique, which is, strictly speaking, not true in our case, but it will be clear that a finer 
analysis would not change the outcome). Let {Yk}i<k<n denote a sample of size n, that is 
the variables {lfc}fc are independent with a common distribution, whose median we denote 
by £1/2 : P {Yi < 6/2) = 1/2- Then the variable 

M n := #{i < n : Y { < £ 1/2 } (4.4) 

has a binomial distribution M n ~ B(n, 1/2) and when n is large (for us it will be at 
least 500) we can approximate N n /n 1/2 + Z/(2y/n), where Z ~ N(0, 1) is a standard 
gaussian. Let us denote the sample quantiles by H 9 , defined for q £ (0, 1) by 



#{i < n : Yi < E q } = [qn\ 



(4.5) 



If we set a := |$ 1 (0.025) | ($ being the standard gaussian distribution function) then the 
random interval 



1 — 1 a , 1 — 1I1 a 

2 2^n 2 ' 2^n 



(4.6) 



is a 95% confidence interval for £ 1( / 2 , indeed 



0.95 = P(Z G [-a, a]) 



, 1 1 
P - + — ^Z G 

2 



1 

L2 



+ 



2^/n ' 2 2^/n 



a 1 a 

' 2 + Vn 



2 2V" 



<£l/2 



< 



(4.7) 
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5. AN EMPIRICAL OBSERVATION ON THE CRITICAL CURVE 

The key point of this section is that, from a numerical viewpoint, h c {-) seems very close 
to /i( m )(-), for a suitable value of m. Of course any kind of statement in this direction 
requires first of all a procedure to estimate h c (-) and we explain this first. 

Our analysis is based on the following conjecture: 

(X,h)eT> =>■ lim Z^JO) = 0, F(du) - a.s.. (5.1) 

The arguments in Section|3](and in the Appendix) suggest the validity of such a conjecture, 
which is comforted by the numerical observation. Since, if (X,h) € C, Z^^iO) diverges 
(exponentially fast) P ( da;) -almost surely and since Z^^fo) is decreasing in h, we define 

^jv,o;(A) as the only h that solves -^^(O) = 1. We expect that hN^(X) converges to h c (X) 
as N tends to infinity, for typical w's. Of course setting the threshold to the value 1 is 
rather arbitrary, but it is somewhat suggested by (|2.4|) and by the idea behind the proof 
of JHJ} (Proposition IB. 21 and equation 

What we have observed numerically, see Figures El and may be summed up by the 
statement 

there exists m such that h^ tUJ (X) as h^ m \X). (5.2) 

Practically this means that /iat^(A), for a set of A ranging from 0.05 to 4, may be fitted 
with remarkable precision by the one parameter family of functions {h( m \-)} . The fitting 
value of Tti — : TTJjv ui does depend on N and it is essentially increasing. This is of course 
expected since localization requires a sufficiently large system (recall in particular Tabled 
and Fig. [2] - see the caption of Fig. for the fitting criterion). We stress that we are 
presenting results that have been obtained for one fixed sequence of u: based on what we 
have observed for example in Section 12.11 for different values of A one does expect that 
for smaller values of A one should use larger values of N, but changing N corresponds to 
selecting a longer, or shorter, stretch of u, that is a different sequence of charges and this 
may have a rather strong effect on the value of mjv^. Moreover there is the problem of 
deciding which A-dependence to choose. This may explain the deviations from Q5.2J1 that 
are observed for small values of A, but these are in any case rather moderate (see Fig. Q). 

A source of stronger (and unavoidable) deviations arises in the cases of unbounded 
charges: of course if 

h > h sat := max (-(w 2 n-i + w 2n )/2) , (5.3) 
ne{l,...,N} 

then Z^ (0) < 1, regardless of the value of A. Moreover it is immediate to verify that 

lim^oo Z^x^iO) = +oo for h < /i sat and therefore fiN, u (X) f /i sat as A f oo. We refer to 
the captions of Fig. for more on this saturation effect. 

We have tried also alternative definitions of h,N ttJ {X), namely: 

(1) the value of h such that = 1 (or a different fixed value); 

(2) the value of h such that the t\ distance between the distribution of the endpoint 
and the distribution of the meander, cf. Section^ is smaller than a fixed threshold, 
for example 0.05. 

What we have observed is that (|5.2|) still holds. What is not independent of the criterion 
is rriN,Lj- Of course believing deeply in (|5.2|) entails the expectation that rhj^^ converges 
to the non random quantity h' c (0). The results reported in this section suggest a value 
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of h' c (0) larger than 0.83 and the cases presented in Section 0] suggest that it should be 
smaller than 0.86. 



Appendix A. The algorithm for computing Z 



We are going to briefly illustrate the algorithm we used in the numerical computation 



of the partition function Zjy = . We recall its definition (see equation (|1.1|0 : 



J N,u> 



Zn = E 



exp 



N \ 
n=l J 



(A.l) 



where A n := (1 — sign(5 n ))/2 and the convention for sign(0) described in the introduction. 

Observe that a direct computation of Z^ from (jA.lj) would require to sum the con- 
tributions of 2 N random walk trajectories, making the problem numerically intractable. 
However, here we can make profitably use of the additivity of our Hamiltonian: loosely 
speaking, if we join together two (finite) random walk segments, the energy of the resulting 
path is the sum of the energies of the building segments. 

We can exploit this fact to derive a simple recurrence relation for the sequence of 
functions {Zm{u) '■= ^2A/(2y), y £ Z} MgN , where Z^(x) = Z^ u (x), the latter defined in 
and we recall that we work with even values of N. Conditioning on S2M and using 
the Markov property one easily finds 

[\Z M {y + l) + \Z M {y) + \Z M {y-l) 



>M+1 



(v) 



Z M (1)+Z M (0) 



+ 



\otM 



Z M {0)+Z_ 



Al { 



y>0 
y = 

y<0 



(A.2) 



\z M (y + i) + \z M {y) + \z M { y -l) 

where we have put au '■= exp ( — 2 A (u>2A/+i + W2A/+2 + 2/i)) . 

From equation (JA.2JI and from the trivial observation that Zu(y) = for \y\ > M, 

it follows that {Zm+i(,v), H £ 2} can be obtained from {Zj^^y), y £ Z} with O(M) 
computations. This means that we can compute Zn in 0(N 2 ) steps. 1 

We point out that sometimes one is satisfied with lower bounds on Zn, for instance in 
the statistical text for localization described in Section \2. II In this case the algorithm can 
be further speeded up by restricting the computation to a suitable set of random walk 
trajectories. In fact when the system size is N the polymer is at most at distance 0(y/N) 
(we recall the discussion in Section 0] on the path behavior) , hence a natural choice to 
get a lower bound on Zfq is to only take into account the contribution coming from those 
random walk paths {s n }neN for which 

-Ay/n <s n < Byjn for n > N , (A.3) 

where A, B, Nq are positive constants. Observe that this is easily implemented in the al- 
gorithm described above: it suffices to apply relation (|A.2|) only for y G [— Ay/M, By/M\, 
while setting Zm+i{v) = for the other values of y. In this way the number of computa- 
tions needed to obtain Zn is reduced to 0(N 3 ^ 2 ). 

The specific values of A, B, No we used in our numerical computations are 3, 8, 1000, 
and we would like to stress that the lower bound on Zn we got coincides up to the 8 th 
decimal digit with the true value obtained applying the complete algorithm. 

^The algorithm just described can be implemented in a standard way: the code we used, written in C, is 
available on the web page: http://www.proba. jussieu.fr/pageperso/giacomin/C/prog.html Graphic 
representations and standard statistical procedures have been performed with R 28 . 
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A final important remark is that for the results we have reported we have used the 
Mersenne-Twister ^H] pseudo-random number generator. However we have also tried 
other pseudo-random number generators and true randomness from www . random . org: 
the results appear not to depend on the generator. 

Appendix B. Proof of the lower bound on h c 

We are going to give a detailed proof of the lower bound (|3.1j) on the critical curve, 
together with some related result. We stress that this appendix can be made substantially 
lighter if one is interested only in the if part of Proposition IB. 21 In this case the first part 
of this appendix is already contained in the first part of § 13.11 up to ()3.4|) . and it suffices 
to look at § lB~2l 

We recall that Z^-'^(O) is the partition function corresponding to the polymer pinned 
at its right endpoint, see (tT"TTT) . and T c = T c {uj) is the first N for which Z N)U (0) > C, 
see (|3.2j) . In particular, for all to such that T c (lu) < oo we have 

Z TC^)>C. (B.l) 

We will also denote by T n := o-(u>i, . . . , uj n ) the natural filtration of the sequence {w n } ne N- 

B.l. A different look at (de) localization. We want to show that (de)localization can 
be read from T . We introduce some notation: given an increasing, 2N-valued sequence 
{£i}ieN> we set to := and Qn '■= max{&; : i& < N}. Then we define 

Z*, w (0) = 4^' A ' h (0) := E [ e -2AE^ =1 K+ft)A„. Stl =0,..., S kN =0,S N = 

Cn-i (B.2) 

= TT z ^ h , »■ (o) • z x > h tt (o) , 

and we recall that 9 denotes the translation on the environment. One sees immediately 
that Zn,uj(0) < Zn,oj(0)- We first establish a preliminary result. 

Lemma B.l. If the sequence {ti}i is such that Cn/N — » as N — > oo, then 

lim 1 log 4*^(0) = f(A, h) , (B.3) 

N—>oo iv 

both F(dw)-a.s. and in Li(P). 

Proof. By definition we have Zjy jUJ (0) > Zn^(0). On the other hand, we are going to show 
that 

4',t(0) < [JliU - ts_0 • (N - t <N ) j 4^' A ' h (0), (B.4) 

where A is a positive constant. To derive this bound, we resort to the equation (|1.13|) that 
expresses ^^^(0) in terms of random walk excursions. We recall that K(2n) is the discrete 
probability density of the first return time of the walk S to 0, and that K(t) > l/(At 3 ^ 2 ), 
t E 2N, for some positive constant A: it follows that for oi, • • • , at £ 2N 

K (a x + . . . + o fc ) < 1 < A k (ai • . . . • a fe ) 3/2 K{ ai ) • . . . • K{a k ) . (B.5) 

This gives us an upper bound to the entropic cost needed to split a random walk excursion 
of length (ai + . . . + a^) into k excursions of lengths a\, . . . , a^. 
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Now let us come back to the second line of (|1.13|) . that can be rewritten as 

Z N ,U0)= Yl G({ Xl }). (B.6) 

{ic i }C{0,...,A r }n2N 

A first observation is that if we restrict the above sum to the {xi} such that {xi} D {ti}, 
then we get zj^ l ^(0). Now for each {x{\ we aim at finding an upper bound on the term 
G({xi}) of the form c • G{{xi} U {ti}) for some c > not depending on {x^}. Each term 
G({xi}), see (jl,13[l . is the product of two terms: an entropic part depending on K(-) and 
an energetic part depending on </?(•). Replacing the entropic part costs no more than 

cent := A 2 ^ (f\(ti - U-!) ■ (N - t CN )) , (B.7) 



thanks to (|B,5|) . On the other hand, the cost for replacing the energetic part is easily 
bounded above by 



c, 



energy 



2 Cjv , (B. 



so that the bound G({x,i}) < c-G({xj}U{ij}) holds true with c := c ent c enevgy . Replacing in 
this way each term in the sum in the r.h.s. of (|B.6j) . we are left with a sum of terms G({yi}) 
corresponding to sets {yi} such that {yi} D {ti}- It remains to count the multiplicity of 
any such {yi}, that is how many original sets {xi} are such that {xi} U {U} = {yi}. Sets 
{xi} satisfying this last condition must differ only for a subset of {ti}, hence the sought 
multiplicity is 2^ N (the cardinality of the parts of {U}) and the bound (|B.4|) follows. 
Therefore we get 



\ogZ { ^ h (0) log4'>) 



N N 




< (21og2^l)^ + 3-lo R ( [[(I, /._,)• 



. /0 , r, a\Cn . „ CN + 1 , ( N 

< (21og2vl)— + 3 — — — log 



(B.9) 



N N °VCjv + 1. 

where in the second inequality we have made use of the elementary fact that once the sum 
of k positive numbers is fixed, their product is maximal when all the numbers coincide 
(for us k = Ov + 1). Since by hypothesis Cn/N — > as N — > oo, the Lemma is proved. □ 

Now we are ready to prove the characterization of C and T> in terms of T . Fix any 
C > 1. 

Proposition B.2. A point (X,h) is localized, that is h < h c (\), if and only i/E[T ] < oo. 

Proof. We set A := {uj : T c (uj) < oo}. Observe that for uj £ ^4^ we have Zn,uj{0) < C for 
every ./V G 2N, and consequently log Z^r'^(0)/iV — > as — > cxd. 

Consider first the case when the random variable T c is defective, that is P[^4^] > 
(this is a particular case of E[T^] = oo). Since we know that ]ogZ% n u (0)/N -» f(A,/i), 
P( do;)— a.s., from the preceding observation it follows that f(A, h) = and the Proposition 
is proved in this case. 

Therefore in the following we can assume that T c is proper, that is P(«4) = 1, so 
that equation (jB.l|) holds for almost every to. Setting 9~ l A := {to : 9u> £ A}, we have 
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since P is ^-invariant, and consequently P (h^qO k A) = 1, which amounts 
to saying that ()B.1|) can be actually strengthened to 

Z TC { e^J°) ^ C VA; > 0, P( do;)-a.s. . (B.10) 

Observe that the sequence {(9 tC ^ uj) n } n ^ has the same law as {w n } ne N and it is in- 
dependent of T T c. We can define inductively an increasing sequence of stopping times 
{T n } n6N by setting T := and T k+i (uj) - T k (u) := T c {6 Tk ^uj) =: S k {uj). We also set 
(n(u) := max{n : T n {uj) < N}. Since {Sfc}fc g N is an IID sequence, by the strong law of 
large numbers we have that, P(dw)-a.s., T n (u))/n — > E[T ] as n — > oo, and consequently 
( n (lj)/N -> 1/E[T C ] as N -> oo (with the convention that l/oo = 0). 

Now let us consider the lower bound Zj^^(0) corresponding to the sequence {ij} = 
{Ti(uj)}: from (lB~2|) and (|B~T0l) we get that P(dw)-a.s. 



Z 



(») = II ^W<.«» • Z T-T, MAR,™ 



i=0 



> (J<n(u) 



AT3/2 



where c is a positive constant (to estimate the last term we have used the lower bound 
Zk{0) > c/k 3 / 2 , cf. and consequently 

M r lQ g4',t(0) ^ ., . , log^gl H} ' A ' fe (0) ^ logC 

= ife, — iv— ^ fc* f * ipsf (R12) 

It follows that if E[T C ] < oo then f(A, h) > 0, that is (A, h) is localized. 

It remains to consider the case EfT*^] = oo, and we want to show that this time Zjq,w{fy, 
defined in (|B. 1 If) . gives a null free energy. In fact, as T c ' (rj) is defined as the first N such 
that Zjv,tj(0) > C, it follows that Z T c^ >rj (0) cannot be much greater than C. More 
precisely, one has that 

Z TC(r,),r,(0) < C exp(2A|?? T c w _ 1 + r] T c {v) \) , (B.13) 
and from the first line of ()B.11|) it follows that 

1 t ( \ 4- 1 r>\ Civ(^ ) 

^iogz^(o) < ^ + logc + - Y, (Knl + KH-al)- (B.i4) 

i=l 

We estimate the second term in the r.h.s. in the following way: 

Cjv(w) . AT 

AT' 



y £ {\ u Ti(w)\ + I^H-ll) - Jj]L 1 £ii:T i {u>)=k}(\Vk\ + l^k-l] 
' =1 fc=l 

^ hv E ^afcHH^} hv E (W + l^-il) (B.15) 



k=l / \ k=l 



1 * 



2 < A 



TV ^— ' - V iV 

fc=i 



Ca^(w) 
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for some positive constant A = A(u) and eventually as N —* oo, having used the Cauchy- 
Schwarz inequality and the law of large numbers for the sequence {|u;£;| 2 }fc e N- Therefore 



> g ^,.(0) < ^^logC + 4A^M, (B.16) 

and since E[T C ] = oo implies ( N {u)/N -> 0, P(dw) -a.s., we have log Zjv i(J (0)/iV —* 0, 
P( do;)— a.s.. Then Lemma IB. II allows us to conclude that f(A, h) = 0, and the proof of the 
Proposition is completed. □ 

B.2. Proof of the lower bound on h c . To prove equation (|3.1j) . we are going to build, for 
every (A, h) such that h < h(X), a random time T such that E[T] < oo and Z T '^ w (0) > C, 

for some C > 1. It follows that T c < T, yielding that E[T C ] < oo and by Proposition IB~2l 
(A, h) is localized, that is, h(X) < h c (X). 

Given M € 2N and q < —h, we start defining the stopping time 

TM (u) = T M , q (u) ■= inf in G 2N : 3k e 2N, k > M : ^«="- fc + 1 ^ < ff | . (B.17) 

This is the first instant at which a (/-atypical stretch of length at least M appears along 
the sequence u. The asymptotic behavior of tm is given by Theorem 3.2.1 in § 3.2] 
which says that P( do»)-a.s. 

logr M (w) w s ,„ 1Q v 
— >E(q) asM^oo, (B.18) 

where S(g) is Cramer's Large Deviations functional for u, ()3.6|) . We also give a name to 
the shortest of the terminal stretches in the definition of tm ■ 

/ spru u) . \ 

R M (u) = Rm, 9 (u) ■= inf Ik e 2N, k > M : 1= ™~ fc+1 - < g L (B.19) 

and it is not difficult to realize that Rm < 2M. 

We are ready to give a simple lower bound on the partition function of size TM,g (for 
any M £ 2N and q < —h): it suffices to consider the contribution of the trajectories that 
are negative in correspondence of the last (favorable) stretch of size Rm-, an d stay positive 
the rest of the time. Recalling that we use if (•) for the discrete density of the first return 
time to the origin and that by (|1.12|) we have K (2n) > c/n -3 / 2 for a constant c > 0, we 
estimate 

z HmJ°) >- A K{r M -R M ) K {Rm) e~^ h ) R ™ > e -^+h)M 

4 4r^ /2 (2M) 3 / 2 



3 

> d exp { -M 



-4A/% - ^ - (AX/3)h - ^ 



(B.20) 



where d := c 2 /^^) 

Having in mind (jB.18j) . we define a random index i = @A,e,q depending on the two 
parameters A £ 2N, e > and on q: 

£(u) = l A ,e, q {u) ■= inf Ik G 2N, k > A : < + A f ( B .21) 
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and we finally set T{uj) = T^ j£ig (a;) := 7gt u> -\(uS). Then for the partition function of size 
T(u) we get 

Z tUJ^ * d exp {|^[(-4A/3)9 - S(g) - (4A/3)fc - ^ - e] } . (B.22) 

The fact that E[7>i e 9 ] < oo for any choice of A, e, q (with q < —h) is proved in 
Lemma lB.31 below. It only remains to show that for every fixed (A, h) such that h < h(X), 
or equivalently 

(4A/3)/i < log M(-4A/3) , (B.23) 

the parameters A, e, q can be chosen such that the right-hand side of equation (|B.22|) is 
greater than 1. 

The key point is the choice of q. Note that the generating function M(-) is smooth, since 
finite on the whole real line. Moreover for all A E M there exists some go S M such that 

log M(-4A/3) = (-4A/% - S(?o) , (B.24) 

and from ()B.23|) it follows that qo < —h. Therefore we can take q = qo, and equation 
l|B.22|) becomes 



Z tU,J°) >c'exp|^ 



log M(-4A/3) - {4X/3)h - - e 



(B.25) 



It is now clear that for every (A, h), such that (|B.23|) holds, by choosing e sufficiently small 
and A sufficiently large, the right-hand side of (|B.25|) is greater than 1, and the proof of 
()3.1|) is complete. 

Lemma B.3. For every A £ 2N ; e > and q < —h the random variable T(tu) = TA,e,q{w) 
defined below ()B.21|) is integrable: E[T] < oo. 



Proof. By the definition ()B.21|) of I = iA,e,q we have 

T AtEtq < exp ((£(<?) + e) l A ^ q ) , (B.26) 

so it suffices to show that for any /3 > the random variable exp(/3 &A,e,q) is integrable. 
For any I S 2N, we introduce the IID sequence of random variables {^} n gN defined by 

-. nl 

Y l n :=- «*■ (B.27) 

i=(n,-l)i+l 

By Cramer's Theorem |l()j we have that for any fixed q < and e > there exists Iq such 
that P (Yjf < q) > e -^ s ('?)+ £ / 2 ) for every I > l . By (jTOTTl have that 

[M/l] 

{£>1}Q {ti > exp((S(g) + e)0} C f| {F/ > q} , (B.28) 

i=l 



with M := exp((£(g) + e)l), so that 

<r- «™ f-\ Mll\.-mi)+e/2)\ 

(B.29) 

< exp (— exp (te/4)) , 
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where the last step holds if I is sufficiently large (we have also used 1 — x < e x ). Therefore 

P (exp(/3 1) > N ) = P (£ > (log N) / f3) < exp (-iV £ / 4 ^ , (B.30) 
when N is large, and the proof is complete. 

□ 
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Figure 4. For A = 0.6 (ft(0.6) ~ 0.36 and ft(0.6) ~ 0.49) , the behavior of log Z 2 n,u, for 
h = 0.42 (A), 0.43 (C,D) and 0.44 (B). The sequence of charges is the same in all the 
cases. In case A, the polymer is localized with free energy approximately 3 • 10~ 6 : the 
linear growth is quite clear, but a closer look shows sudden jumps, which correspond to 
atypically negative stretches of charges. Getting closer to the critical point, case C, the 
growth is still rather evident, but it is clearly the result of sudden growths followed by slow 
decays (approximately polynomial with exponent —1/2). Case B suggests derealization: 
a closer analysis reveals a decay of the type iV -1 / 2 , but sharp deviations are clearly 
visible and these deviations are in reality much larger, since in the graph we have plotted 
just one point every 10000. Case D is the zoom of the rectangle in the left corner of C. 
The similarity between B and D make clear that claiming derealization looking at the 
behaviour of the partition function is difficult. 
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Figure 5. Graphical representation of the data of Tables[3](on the right) and^](on the 
left). The plotted points are the sample medians against the sample size, the error bars 
correspond to the confidence intervals given in Tables and H 
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Figure 6. On the left the case of binary symmetric o>i and on the right the case of 
ui ~ N(0, 1), boths for N = 3.2 ■ 10 7 . The small circles represent the computed values: 
the errors on /ijv,^ (A) are negligible and the plotted points are at the centers of the circles. 
The continuous line is instead the curve h^ m \-). In the binary case m = 0.841 and it 
has been chosen by solving /i' m '(4) = ftjv, w (4). In the Gaussian case m = 0.802, the 
maximum of /ijv,u(A)/A for the plotted values of A(> 0). The rather different values of 
rriN,u) may be somewhat understood both by considering that these two curves have been 
obtained for a fixed realization of uj and by taking into account the remark at the end of 
the caption of Table [2] it appears that for Gaussian charges one needs longer systems in 
order to get closer to the values of m observed in the binary case (in particular: for the 
prolongation, with the same random number generator, of the Gaussian ui sample used 
here up to N = 5 • 10 7 one obtains mjv,^ = 0.812). 
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Figure 7. Relative errors rjv,w(A) := ^/i (m) (A) — ft.w, w (A)j /ftjv jCJ (A), for the value m = 

itin.ui explained in the caption of Fig.E]and for the cases of N = 2.5 • 10 (x dots), and 
N = 3.2 • 10 7 (+ dots). Notice that in the binary case the error is more important for 
small values of A (recall Table [5] and Fig. Instead for the Gaussian case there is a 
deviation both for small and large values of A: the deviation for large values is due to 
the saturation effect explained in the text. Given the fact that h sa ,t, cf. 15.31 . behaves 
almost surely and to leading order for N — » oo as v / IoglV one understand why the slow 
disappearing of the saturation effect has to be expected. In both graphs the dotted line 
above the axis is at level 0.01. The fitted values for mjv,o>) N = 2.5 • 10 5 , are 0.821 in the 
binary case and 0.778 in the Gaussian case. 
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